🔲 ML Hardware - nickyfoto · Scour

DeepSeek-V4-Flash Benchmarks, FlashRT CUDA Runtime, & V100 LLM Performance ⚡Performance Engineering

dev.to·1d·DEV

SMG: The Case for Disaggregating CPU from GPU in LLM Serving ⚡Performance Engineering

pytorch.org·6d·Hacker News

Not All Thoughts Need HBM: Semantics-Aware Memory Hierarchy for LLM Reasoning 🧠LLMs

CUDA Proves Nvidia Is a Software Company ⚡Performance Engineering

hardware.slashdot.org·21h

In a quest to becoming AI-independent 🤖AI Research

adlrocha.substack.com·2d·Substack

Google New TPU Generation is Specifically Designed for Agents and SOTA Model Training 🤖AI Research

CPU GPU Combo - AMD RYZEN 7 9850X3D + GIGABYTE 5070ti GV-N507TEAGLE OC-16GD $1265 at Newegg ⚡Performance Engineering

forums.anandtech.com·19h

$200 'socketed' Nvidia AI GPU for servers hacked into a PCIe card with custom PCB and 3D-printed cooling — modded Tesla V100 SMX data center GPU runs AI LLMs and is more efficient than many modern midrange offerings in AI inference 🔌Embedded Systems

tomshardware.com

·2d·r/LLM

Secure short-term GPU capacity for ML workloads with EC2 Capacity Blocks for ML and SageMaker training plans ☁️Cloud Computing

aws.amazon.com·5d

I'm going back to writing code by hand 🛠️Software Craft

blog.k10s.dev·1d·Lobsters, Hacker News

AI: Apologies, I was only doing as instructed. (What Hollow is and isn't) 🕵️AI Agents

ninjahawk.github.io·1d·Hacker News

Getting a Proprietary-Bus GPU onto PCIe Enables Cheaper Local LLMs, For Now ⚡Performance Engineering

hackaday.com·3d

Merserk/comfyui-ubuntu-universal-installer: One-click Linux and WSL2 installer for ComfyUI with auto NVIDIA CUDA / AMD ROCm GPU detection, latest PyTorch setup, and launch script creation. 💻Command Line Tools

github.com·1d·r/StableDiffusion

AMD puts out new slottable GPU for AI-curious enterprises ⚡Performance Engineering

theregister.com·5d·r/LocalLLaMA

The Powerful Lenovo Legion RTX 5090 Gaming PC Drops to the Lowest Price of the Year, Also Includes 64GB of DDR5 RAM and a 2TB SSD 🔌Embedded Systems

The cuda-oxide Book ⚡Performance Engineering

nvlabs.github.io·4d·Lobsters, Hacker News, Hacker News, r/rust

PC still keeps crashing for an unknown reason ⚡Performance Engineering

techcommunity.microsoft.com·1d

The Great Pivot in Mining: Inside the Miner-to-GPU Migration 🔌Embedded Systems

hackernoon.com·4d

Long video generation blog: How We Shipped SVI in Production ⚡Performance Engineering

atlascloud.ai·5d·DEV

Practical Gemma 4 Benchmarking with LM Studio ⚡Performance Engineering

dev.to·1h·DEV

Log in to enable infinite scrolling